A time-delay neural network architecture for isolated word recognition
نویسندگان
چکیده
-A translation-invariant back-propagation network is described that performs better than a soph&ticated continuous acoustic parameter hidden Markov model on a noisy, lO0-speaker confusable vocabulary isolated word recognition task. The network's replicated architecture permits it to extract precise information from unaligned training patterns selected by a naive segmentation rule. Keywords--Isolated word recognition, Network architecture, Constrained links, Time delays, Multiresolution learning, Multispeaker speech recognition, Neural networks.
منابع مشابه
A Time-derivative Neural Net Architecture - an Alternative to the Time-delay Neural Net Architecture
Though the time-delay neural net architecture has been recently used in a number of speech recognition applications, it has the problem that it can not use longer temporal contexts because this increases the number of connection weights in the network. This is a serious bottleneck because the use of larger temporal contexts can improve the recognition performance. In this paper, a time-derivari...
متن کاملMulti-State Time Delay Neural Networks for Continuous Speech Recognition
Alex Waibel Carnegie Mellon University Pittsburgh, PA 15213 [email protected] We present the "Multi-State Time Delay Neural Network" (MS-TDNN) as an extension of the TDNN to robust word recognition. Unlike most other hybrid methods. the MS-TDNN embeds an alignment search procedure into the connectionist architecture. and allows for word level supervision. The resulting system has the ability to ma...
متن کاملA connectionist recognizer for on-line cursive handwriting recognition
In this paper we show how the Multi-State Time Delay Neural Network (MS-TDNN), which is already used successfully in continuous speech recognition tasks, can be applied both to online single character and cursive (continuous) handwriting recognition. The MS-TDNN integrates the high accuracy single character recognition capabilities of a TDNN with a non-linear time alignment procedure (dynamic t...
متن کاملPerformance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition
Connectionist Rpeech recognition systems are often handicapped by an inconsistency between training and testing criteria. This problem is addressed by the Multi-State Time Delay Neural Network (MS-TDNN), a hierarchical phonf'mp and word classifier which uses DTW to modulate its connectivit.y pattern, and which is directly trained on word-level targets. The consistent use of word accuracy as a c...
متن کاملA Comparative Study of the Multi-Layer Perceptron, the Multi-Output Layer Perceptron, the Time-Delay Neural Network and the Kohonen Self-Organising Map in an Automatic Speech Recognition Task
This paper describes a study of the use of four different neural network techniques for automatic speech recognition (ASR) using two common, real-world application databases. The neural network techniques investigated were the Multi-Layer Perceptron (MLP), the Multi-Output-Layer Perceptron (MOLP), which is an improved version of the MLP , the Time-Delay Neural Network (TDNN) and the Kohohnen Se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Neural Networks
دوره 3 شماره
صفحات -
تاریخ انتشار 1990